perm filename CRITIQ[4,KMC]1 blob
sn#115497 filedate 1974-08-14 generic text, type T, neo UTF8
00100
00200 CRITICISMS OF PARRY
00300
00400 K.M. COLBY
00500
00600
00700 Much of the A-I community is now aware of our simulation of
00800 paranoid processes called PARRY. The model has been available for
00900 interviewing on the ARPA network and thousands of interviews have
01000 been conducted with several versions of the model. During the long
01100 period of development of the model we have been aware of the
01200 potential limitations of various alternative programming approaches
01300 to designing an algorithm capable of conducting useful non-trivial
01400 dialogue in natural language. Colleagues, assocoates, and students
01500 have volunteered a number of criticisms of the model. I would now
01600 like to answer them en bloc. Only serious, reasoned, and
01700 well-founded criticisms will be considered.
01800 Workers in A-I come from different intellectual traditions.
01900 This influences what people want from a model and would like to see
02000 in a model. Those from mathematical and logical backgrounds like to
02100 see lots of deductive inference; those from physics and chemistry
02200 like to see laws; those from the life sciences like to see
02300 complexity, growth, and development represented in a model. It is
02400 important that we recognize and respect the traditions and
02500 philosophies of both the demonstrative and empirical sciences. Those
02600 raised on a Euclidean model of knowledge seek to understand phenomena
02700 using a few definitions and axioms, a few rules of inference, long
02800 chains of inference, and deductive consistency. Some aspects of
02900 experience yield to this approach but many, especially in the case of
03000 biological organisms, do not.
03100 Everyone realizes that a model represents a simplification
03200 and an idealization. In constructing a model, only a few variables
03300 are selected as centrally relevant while the rest are neglected as
03400 econdary or unknown. Only a few relations between the relevant
03500 variables are introduced. Thus a model does not match exactly that
03600 which it models in all details. It is partial in that only some
03700 aspects of the referent system are represented and it is an
03800 approximation in that it is limited in depth and not free of error. A
03900 model is an idealization in that it may utilize abstractions and it
04000 may possess perfect properties known to be lacking in its natural
04100 counterpart. Thus the model's knowledge is not as extensive as that
04200 of a person and it possesses a perfect memory unencumbered by
04300 inhibitory processes. We can allow ourselves this idealization of
04400 perfect memory because we are not studying, for example, memory
04500 decay. since we do not consider it to be a pertinent variable in
04600 paranoid processes.
04700 I shall list the major criticisms of the model which have
04800 come to our attention and attempt replies to each.
04900
05000 CRITICISM #1:
05100 PARRY is simply a stimulus-response model. It recognizes
05200 something in the input and then just responds to it without
05300 "thinking" or inferring. The model should interpret what it sees
05400 and engage in more computation than execution of a simple rewrite or
05500 production rule. REPLY:
05600 It is true that in early versions of the model many of the
05700 responses consisted of simple rewrites, e.g. when the input
05800 consisted of "Hello", the output response was "Hi" and no rules other
05900 than of the type "see x, say y" were involved. But as we began to
06000 improve and extend the model, this type of response disappeared. In
06100 the current version, the model consists of two modules, one for
06200 recognition of natural language expressions and one for response.
06300 Once the recognizer decides what is being said, the response module,
06400 using a number of tests and rules, decides how to respond. Thus a
06500 "Hello" no longer receives an automatic "Hi" but may receive a
06600 variety of responses depending on a large number of conditions,
06700 including a "model" of the interviewer which PARRY builds up during
06800 the interview. This representation of the interviewer involves
06900 making inferences about his competence, his helpfulness, etc.
07000
07100 CRITICISM #2:
07200 PARRY's performance constitutes an illusion. The model's
07300 data-base knowledge is too limited to represent adequately all that a
07400 person knows. Because the model can answer a few questions well,
07500 people,because of their own expectations, are easily fooled into
07600 believing PARRY is capable of answering the great variety of
07700 questions a person is capable of answering. People will assume there
07800 is much more there than there really is. Thus PARRY is a fake, a
07900 mirage, a conjurer's trick in which the audience is led to believe
08000 something is true when it is not.
08100
08200 REPLY:
08300 One of Descartes' tests for distinguishing man from machines
08400 was that the latter "did not act from knowledge but only from the
08500 disposition of their organs". (Descartes' other test concerned
08600 linguistic variety). Granted that a model of a psychological process
08700 should contain knowledge, the questions become, how much knowledge
08800 and how is it to be represented?
08900 Since a model is a simplification, it has boundary
09000 conditions. A model of a paranoid patient is a model of being
09100 paranoid, being a patient, and being a person. PARRY does reasonably
09200 well in the first two of these "beings". It fails in the third
09300 because of limited knowledge. How can we decide what the model should
09400 know? It is theoretically trivial to add tomes of facts to the data
09500 base but this seems to be what some A-I critics want. The fact that
09600 PARRY can discuss some topics rather well indicates it is doing the
09700 right things in these domains and could do well in other domains that
09800 are functionally similar. Simply adding facts without improving the
09900 algorithm can lead to a degradation of performance asexperience with
10000 belief-system simulations and theorem-proving programs has shown.
10100 More important than sheer number of facts is how they are
10200 organized, how they are represented, and how they are handled by the
10300 processing rules to contribute to the charactersitic performance of
10400 the model. Some seem happy to know there are fixed propositions or
10500 "frames" in the data-base which can be consulted in answering
10600 questions. Even if a model can answer 50 questions about a topic
10700 using rewrite rules, some would say the model does not really "know"
10800 anything about the topic. The procedural- declarative argument has no
10900 end in sight. It seems to be a matter of personal style and
11000 efficiency.
11100 PARRY is not a literal copy of a total person. The test of
11200 adequacy here is not Turing's machine-question-"which is person and
11300 which is machine?" This is not a stringent test since the criteria
11400 for distinguishing what is human behavior over a teletype have not
11500 been systematically worked out, i.e. almost anything is accepted as
11600 being human. PARRY is not the real thing; it is a model, a
11700 simulation, an imitation, a mind-like artifact, an automaton,
11800 synthetic and artificial. The real thing, a living person, is
11900 characterized by such great logical complexity, inhomogeneity of
12000 class, and individuality that a strategy of simplification is called
12100 for.
12200 CRITICISM #3:
12300 PARRY models paranoid behavior without modelling the
12400 underlying mechanisms of paranoid processes. Because the I-O behavior
12500 of PARRY is indistinguishable from the I-O behavior of paranoid
12600 patients, it does not mean that the same mechanisms are involved.
12700
12800 REPLY:
12900 This is so true as to be an A-I truism. When the inner
13000 mechanisms of a system are inaccessible to observation, one must make
13100 plausible guesses as to what is going on. These guesses represent
13200 analogies. They are not to be taken as the "same" mechanisms. If we
13300 knew the " real" mechanisms, there would be no need to posit
13400 analogies about a hidden reality. We try to design structures to fill
13500 in the black box. Further empirical tests and experiments are
13600 necesary to increase the plausibility of the analogy proposed.
13700 Successful predictions and pragmatic usefulness increase the
13800 acceptability of the model to the relevant expert community or
13900 communities.
14000 We can never know with certainty whether a model is "true".
14100 If it is consistent with itself and with the data of observation,
14200 then it is valuable cognitively and pragmatically. Such coherence is
14300 not a definition of truth but a criterion for truth.
14400 An expert community has various criteria for acceptability of
14500 a model. Sometimes it is demanded that a model provide an
14600 explanation. What constitutes an explanation may range from
14700 describing causes to making intelligible the connections between
14800 input and output. An extreme view is that science does not explain
14900 anything; A is simply interpreted in terms of B and B in terms of C,
15000 etc.
15100 A pragmatic criterion for a model is whether it represents a
15200 a workable possibility? Can it be tested and measureably improved
15300 as a result of these tests? That is, is there an evaluation procedure
15400 for accumulative progress? In the case of PARRY, the answer to these
15500 questions is "yes".
15600
15700 CRITICISM #4:
15800
15900 PARRY is an ad hoc model. It is designed after the fact to
16000 fit a limited set of special cases and lacks generality.
16100
16200 REPLY:
16300 Sometimes this criticism is levelled at the
16400 language-recognition processes and sometimes at the scope of the
16500 model. The language recognizer of PARRY is a pattern-matcher. But the
16600 surface English input expressions are transformed into more abstract
16700 patterns which are matched against stored patterns. The many-to-one
16800 tranformation involves synonymic-translations and word-classes. Thus
16900 the language-recognizer has some generality in that these processes
17000 can be used by any "host" system which takes natural language input.
17100 It is true that PARRY is circumscribed. It "explains" the
17200 data it was designed to explain. One wants to achieve at least this
17300 degree of explanatory power in a model. But can it predict a new fact
17400 or fit a new fact discovered in some other way? This view sees ad
17500 hociness, not as a property of a model, but as a relation between two
17600 consecutive models or theories. Does PARRY have some novel
17700 consequence compared to its predecessor? One trouble is that
17800 predecessor formulations explaining paranoia have been so vaguely
17900 stated as to be untestable. The theory embodied in the model has
18000 novel consequences compared to other formulations.
18100 For example, the theory posits that the paranoid mode of
18200 thought involves symbol-processing strategies which attempt to
18300 forestall or minimize the affect of humiliation. A novel consequence
18400 of this theory is that if a person were desensitized to the negative
18500 affect of humiliation, he would be less prone to utilize the
18600 strategies of the paranoid mode.
18700
18800 CRITICISM #5:
18900 PARRY'S paranoid behavior is strictly the result of canned
19000 paranoid-like responses. Granted that PARRY is diagnosed as paranoid
19100 by expert judges, this diagnosis is not a consequence of the theory
19200 embodied in the model but is simply produced by the model's canned
19300 replies which are linguistically paranoid in nature.
19400
19500 REPLY:
19600
19700 This is a weighty criticism because it implies that the
19800 theory of humiliation and the rules of the model are excess baggage.
19900 The made-up output replies are so typical of paranoid verbal
20000 responses that they alone might be sufficient to simulate paranoid
20100 interactions.
20200 Given that a model had a list of paranoid-like responses, it
20300 would still need some mechanism or rules for selecting which response
20400 to output in reply to a specific input. Experiments have shown that
20500 random selection from this list results in an inadequate performance.
20600 For example, on a dimension of "thought disorder" on a 0-9 scale, (0
20700 means zero amount and 9 means a large amount), a random model
20800 received a mean rating of 5.94 from expert psychiatrists. Patients
20900 rated by the same judges received a mean rating of 2.99 whereas a
21000 version of PARRY was rated at 3.78.
21100 Little is known about how to generate surface English which
21200 is appropriate to the input and phrased in a characteristic style.
21300 Segment-by-segment generation or even word-by-word generation would
21400 be preferable to outputting canned sentences as long as the rules
21500 posited for the paranoid mode were somehow called into play in the
21600 generation process. (Fortunately no one has demanded that PARRY
21700 generate words letter-by-letter to account for alternative
21800 spellings). Since generation of natural language output represents
21900 one of the major shortcomings of the model, we are at present
22000 attempting to couple the generation more closely with the model's
22100 theory.
22200 CRITICISM #6:
22300 The model, even if successful, is useless. Does it teach us
22400 anything about paranoia? Can it be used to help patients suffering
22500 from paranoid disorders?
22600
22700 REPLY:
22800 The model represents an attempt to make intelligible paranoid
22900 processes in explicit symbol-processing terms. A model of
23000 psychopathology in which the mind is in error about some of its own
23100 processes has implications for prevention, reduction, and cure of
23200 disorder. PARRY intersects two expert communities consisting of
23300 researchers in artificial intelligence and clinicians in psychiatry.
23400 Clinicians are practical men who are interested in technological
23500 applications.
23600 If the disorder is at the "hardware" level of brain
23700 pathology, then the application of symbol-processing techniques might
23800 be of little use. But if there is reason to believe the disorder is
23900 at the program level of learned, acquired strategies, then attempts
24000 at re-programming through symbolic-semantic techniques are worth
24100 considering. At present clinicians have great difficulties treating
24200 paranoid disorders. For a clinician practicing behavior therapy, the
24300 model's theory suggests desensitizing the patient to humiliation, a
24400 technique which has been successful with other negative affects such
24500 as anxiety. For those practicing psychotherapy, the model's theory
24600 suggests exploring the topics of humiliation and self-censure in the
24700 hope of helping the patient to reject his judgements of himself as
24800 inadequate. Judging whether these treatments are effective would
24900 depend on clinical evaluations.
25000
25100 CRITICISM #7:
25200 PARRY does not tell us what is the cause of paranoid
25300 thinking. Efective treatment requires we know the cause of a
25400 disorder.
25500
25600 REPLY:
25700 PARRY does not account for how a system got to be that way;
25800 it describes only how the system now works. An ontogenetic or
25900 morphogenetic model would show how a normal system became that way as
26000 a result of its experience over time.
26100 It is not true that to have effective treatments one must
26200 know the cause of a disorder. Illnesses involve loops and circles
26300 which, if broken anywhere, can lead to relief of the disorder even
26400 when the mechanism of action of the treatment is not understood.
26500 Common examples of successful treatments for illnesses of unknown
26600 causes are insulin in diabetes, digitalis in congestive heart
26700 failure, colchicine in gout, and lithium in mania.
26800
26900 CRITICISM #8
27000
27100 The tests PARRY has passed are not severe enough. If a model
27200 passes a validation test, it might not be because it is a good model
27300 but because the test is weak.
27400
27500 REPLY:
27600 Our strongest test invloves having judges rate interviews
27700 with versions of the model and with paranoid patients. We utilize
27800 statistical measures to see how closely the model's performance
27900 matches that of the patients and how much better it performs than
28000 previous model-versions. A recent study showed that on the dimension
28100 of linguistic comprehension independent raters gave PARRY2 a mean
28200 rating of 5.48 on a scale of 0-9. A previous version of PARRY
28300 received a mean rating of 5.25. This improvement is significant at
28400 the 0.05% level. But the model is still far from the 7.42 rating
28500 received by the patients. The rating groups ( psychiatrists and
28600 graduate students ) have been shown to be reliable, i.e. there is
28700 agreement both within groups of raters and between groups.
28800 Stronger tests are certainly needed and we would welcome
28900 suggestions along these lines. Are there validation tests others have
29000 used which might be suitable for PARRY? In the past most models have
29100 relied on face validity. To improve a model measureably, we need
29200 better tests and statistical measures.
29300 CRITICISM #9:
29400
29500 PARRY is too sketchy, crude, and immature a model. Such
29600 theoretical models can be premature for a field and can turn out to
29700 be irrelevant or counterproductive. We should collect more data
29800 about naturally-occurring paranoia before attempting model
29900 construction.
30000
30100 REPLY:
30200
30300 No one really knows when to begin theorizing. Even facts are
30400 now believed to be heavily theory-laden, whether their collector
30500 realizes it or not. One of the perils of model building is that
30600 data used to test a model may demolish it. A model is only sufficient
30700 unto the day.
30800 If PARRY is not acceptable, then one accepts some rival
30900 formulation ( a current one is "paranoia represents the
31000 transformation of love into hate"), or one accepts nothing and waits.
31100 Waiting for perfection can be paralyzing to a field, ,especially one
31200 devoted to patients who need help.
31300 As a simplification, PARRY is perhaps too simple at the
31400 moment. In constructing a model, one strives for something simpler
31500 than the "real" referent system which is difficult to understand or
31600 manipulate. But one wants to retain the important features
31700 characteristic of the natural counterpart. If the model is too
31800 simple, it is unable to reproduce these important features and
31900 extrapolation to the natural referent system becomes risky. If the
32000 model is too complex, it becomes as difficult to understand and
32100 manipulate as the real thing. Faced with this dilemma, a model
32200 builder can improve his model by simplifying it or making it more
32300 complicated while retaining consistency.
32400 These points are discussed in greater detail in a forthcoming
32500 book entitled ARTIFICIAL PARANOIA: A Computer Simulation of Paranoid
32600 Processes to be published in a few months by Pergamon Press.